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ABSTRACT 

We analyze the clustering properties of a high-resolution (~ 10 km/s) sample of about 
1600 Lyman-a lines in the spectra of 15 quasars, obtained in the framework of an ESO 
key-programme with the addition of literature data. The two-point correlation func- 
tion in the velocity space shows a significant signal on small velocity scales (Av & 300 
km/s) with amplitude and significance increasing with increasing column density. The 
correlation scale at z ~ 3 turns out to be 200 — 300/i^q 1 kpc. A trend of increasing corre- 
lation with decreasing redshift is apparent. The existence of over- and under-densities 
on scales of a few tens of Megaparsec is confirmed with a high confidence level and a 
number of possible structures are identified. The present observations are found to be 
consistent with models of gravitationally induced correlations. A continuity scenario 
between Lyman-a and metal systems emerges, with a suggested physical association 
between the Lyman-a clouds with log Nhi ^ 14 and the halos of protogalactic sys- 
tems. 
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1 INTRODUCTION 



The Lyirran=n "forest? of absorption Hues seen in Hie spectra 



of high hedsliiR quasars is generally ascribed Co pre-galailk. 

or proto-galactic material. The study of the spatial distri- 
bution of these clouds offers a powerful tool to gain insight 
in the formation and evolution of structures up to redshifts 
4-5, much larger than those reachable with observations of 
galaxies, limited at z ^ 1. The only other class of objects 
that at present can provide this type of information are the 
QSOs themselves. Lyman-a absorbers have, however, two 
definite advantages with respect to QSOs: they are much 
more numerous and, being less exotic objects, are expected 
to trace more "normal" matter. 

The search for clustering of the Lyman-a lines has pro- 
vided along the years results that have been defined dis- 
appointing or even controversial. Systematic studies of the 
distribution of redshifts in the QSO Lyman-a forest bega n 
in the early 1980's with the work by Sargent et al. (1980), 
which concluded that no structures could be identified. Al- 
most all the subsequent results have failed to detect any sig- 
nific ant correlation on v e locity scales 300 < < 30000 km 



s -1 (Sargent et al. 1982 



Bechtold 1987 



Webb and Barcons 



1991). On smaller scales (Av = 50 — 3 00 km s ) t here have 



been indications of weak clustering (Webb 1987; Rauch et 



al. 1992; Chernomordik 1995; Cristiani et al. 1995), together 



with relevant non-detections flPettini et al. 1990 



and Webb 199£ 



Stenglei 



On the contrary, metal-line systems selected by means 
of the CIV doublet ( [gargent et al. 1988| ) have been early 
recognized to show strong clustering up to 600 km s — 1 , sug- 
gesting a different spatial distribution. In fact, the absence of 
power in the two-point correlation function has been claimed 
as a striking characteristics of the Lyman-a forest and has 
been used as a basic argument to develop a scenario of the 
Lyman-a absorbers as a totally distinct population with re- 
spect to metal systems and therefore galaxies. 



2 THE DATABASE 

The present analysis is based on data obtained in the frame- 
work of an ESO key-programme devoted to the study of 
QSO absorption systems at high redshifts. Up to now spec- 
tra of six QSOs, at resolutions between 9 and 14 km s , 
have been reduced. The list of the objects, with emission red- 
shifts ranging from 3.27 to 4.12, is given in Table 1 together 
with the main characteristics of the sample of Lyman-a ab- 
sorption lines. This constitutes a unique database, especially 
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Table 1. QSO Spectra from the ESO KP 


QSO 




FWHM 


Mag 


No of 


Sample 


Name 




(km s — - 1 ) 




lines 


Limit 












iog(N HI ) 


2126 - 15 


3.27 


11 


V=17.3 


106 


13.3 


2355 + 01 


3.39 


9 


V=17.5 


76 


13.3 


0055 - 26 


3.67 


11 


V=17.5 


187 


13.3 


1208 + 10 


3.82 


9 


V=17.5 


66 


13.3 


1108 - 07 


3.95 


9 


R=18.1 


38 


13.3 


0000 - 26 


4.12 


12 


V=17.5 


181 


13.3 


Other QSO Spectra 


from the literature 






QSO 




FWHM 


Mag 


No of 


Sample 


Name 




(km s —1 ) 


(V) 


lines 


Limit 












log (Nhi ) 


1331 + 17" 


2.10 


18 


16.9 


55 


13.0 


1101 - 26 6 


2.15 


9 


16.0 


37 


13.3 


2206 - 19 c 


2.56 


6 


17.3 


47 


13.3 


1700 + 64 d 


2.72 


15 


16.1 


73 


13.3 


1946 + 76 e 


3.02 


10 


15.8 


72 


13.3 


0636 + 68 f 


3.17 


8 


16.6 


172 


13.0 


0302 - OQf 


3.29 


8 


17.6 


151 


13.0 


0956 + 12 f 


3.30 


8 


17.6 


151 


13.0 


0014 + 81 f 


3.41 


8 


16.5 


164 


13.0 



References: 
a Kulkarni et al. 1995 
b Carswell et al. 1991 
c Rauch et al. 1993 
d Rodriguez-Pascual et al. 1995 
e Fan and Tytler 1994 
f Hu et al. 1995 



at high redshift, where the density of lines provides partic- 
ular sensitivity to any clustering signal. 

Spectra of the QSOs 2126 - 15 8, 0055 - 269, 0000 - 



2619 have been already published (Gjallongo et al. 1993 



Cristiani et al. 1995| ; |Savaglio et al. 1996]) . For 2126-158 new 



spectra have been added, significantly improving the S/N 
with respect to the published data. A complete description 
of the individual spectra will be given elsewhere: here we 
discuss the clustering properties of the Lyman-a sample. 

All the spectra have been analyzed in a uniform way 
and all the lines have been fitted using our FITLYMAN 
program which is now availab le in the ESO-MIDAS package 



(Fontana and Ballester 1995). It performs a x minimiza- 



tion to derive the redshift z, the Doppler parameter b and 
the column density N for isolated lines and individual com- 
ponents of the blends. Whenever possible, the Lyman-/? lines 
have been used to constrain the number of components in 
the strong saturated Lyman-a blends. 

The S/N has been computed from the noise spectrum, 
and is typically greater than 10 per pixel element (corre- 
sponding to 15 per resolution element). In the regions near 
the QSO Lyman-a emissions and for the brighter quasars 
the S/N raises to ~ 20 — 60 per resol ution element . Ex- 



tensive simulations by vari ous authors (Rauch et al. 1993 



derived statistical distributions is expected when the S/N is 
>10. 

To identify heavy element systems we have compared 
the lists of the observed lines with a catalog, derived from 
Morton (1991), containing the most frequently seen lines 
in QSO absorption spectra. As customary, we searched for 
s ignificant exc esses of identifications at all possible redshifts 
( [Bahcall 196^ ). 

We have complemented our data with other spectra 
available in the literature with similar resolution and red- 
shift range, obtaining a final sample of 15 QSOs (in the 
following extended sample, see Table 1). 

All the identified heavy element systems present in the 
Lyman-a forest have been removed from the extended sam- 
ple, which consists of almost 1600 Lyman-a lines with ob- 
served column density log Nhi > 13. 

Lines affected by the proximity effect, i.e. within a 
proper distance of 10 Mpc from the emission redshift of each 
QSO, have been excluded from the clustering analysis. 

We adopt throughout the value Ho = 50 km s _1 Mpc" 1 
for the Hubble parameter and qo = 0.5. 



3 THE STATISTICAL ANALYSIS OF THE 
CLUSTERING 

3.1 Small scales 

As a first statistical tool to study the clustering properties of 
our sample of Lyman-a lines we have adopted the two-point 
correlation function (TPCF), defined as the excess, due to 
clustering, of the probability dP of finding a Lyman— a cloud 
in a volume dV at a distance r from another cloud: 



dP = $ Lya (z)dV[l + Z{r 



(1) 



where &(z) is the average space density of the clouds as a 
function of z. The TPCF is known to be a satisfactory es- 
timator when used to investigate weak clustering on scales 
considerably smaller than the total interval covered by the 
data, which is the first domain we want to explore. The bin- 
ning, intrinsic to this method, causes a loss of information, 
but the ease in visualizing its results and in including ob- 
servational effects in the computing codes have made of the 
TPCF one of the favorite statistical estimators in cosmology. 

In practice the observations provide the redshifts of the 
Lyman-a lines that, due to peculiar motions, are not im- 
mediately transformed in comoving distances. Therefore it 
is normally preferre d to compute the TPCF in the velocity 
space, estimated as ( Peebles 198C ) 

N obs (Av) 



£(Ad) = 



N exp (Av) 



(2) 



where N b s is the observed number of line pairs with velocity 
separations between Av and Av + e v and N exp is the num- 
ber of pairs expected in the same interval from a random 
distribution in redshift. 

At the small velocity separations we are dealing with, 
the variation of the distance scale with cosmic time can be 
neglected and the velocity di fference can be simp ly deduced 
from the redshift difference (Sargent et al. 1980) 



Fontana and Ballester 1995) show that no strong bias in the 



Av 



C (Z2 — Zl) 

l + (zi+z 2 )/2 



(3) 
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Figure 1. Two-point correlation function in the velocity space: a) 
(upper panel) for the complete sample of Lyman-a lines, b) (lower 
panel) for lines with column densities < 10 13,6 cm -2 . The short- 
dashed and long-dashed lines represent the la and 2a confidence 
limits for a Poissonian process 



where Av is the velocity of one cloud as measured by an 
observer in the rest-frame of the other. 

In our line sample N exp is obtained averaging 1000 nu- 
merical simulations of the observed number of redshifts, try- 
ing to account for all the relevant cosmological and observa- 
tional effects. In particular the set of redshifts is randomly 
generated in the same redshift interval as the data according 
to the cosmological distribution oc (1 + z) 7 , where the best 
value of 7 = 2.65 has been derived from a maximum likeli- 



hood analysis of the same database (Gliallongo et al. 1996). 
The results are not sensitive to the value of 7 adopted and 
even a flat distribution (i.e. 7 = 0) gives values of £ that dif- 
fer typically by less than 0.02. Incomplete wavelength cover- 
age due to gaps in the spectrum or line blanketing of weak 
lines due to strong complexes is also accounted for. Lines 
with too small velocity splittings, compared with the finite 
resolution or the intrinsic blending due to th e typical line 
widths - the so called "line-blanketing" effect (Gliallongo et 



al. 199^) -, are excluded in the estimate of N e . 

The resulting correlation function for the full extended 
sample of Lyman— a lines is shown in the upper panel of Fig. 
1. A weak but significant signal is present with £ ~ 0.2 in the 
100 km s" 1 bin: 739 pairs are observed while only 624 are 
expected for a random distribution, a 4.6a deviation from 
poissonianity. 




1000 2000 

Av (km s" 1 ) 



3000 



Figure 2. Two-point correlation function in the velocity space 
for lines with column densities > 10 138 cm -2 . Confidence limits 
as in Fig. 1. The continuous line shows the model described in 
Section 4, eq. 4, with 7 = 1.77, a = 50 km/s, r c i 
and ro = 250h.7 o 1 kpc at 2 = 3. 



As in Cristiani et al. (1995), we have explored the varia- 
tions of the clustering as a function of the column density. In 
Fig. lb the correlation function for lines with log Nhi < 13.6 
is shown. All the evidence for clustering has disappeared. 
On the contrary, for lines with log Nhi 13.8 (Fig. ^), the 
correlation function at Av = 100 km s _1 shows a remark- 
able increase in amplitude (£ ~ 0.6) and significance: 234 
pairs are observed while only 145 are expected for a random 
distribution, a more than 7er deviation from poissonianity. 
No relevant feature other than the peak at small velocity 
separations is observed. In particular the evidence for anti- 
clustering on scales ~ 600 — 100 km s _1 , previously sug- 



gested by Meiksin and Bouchet ( 1995 ) on the basis of data 
of 0055 - 259 and 0014 + 813, is not confirmed. It should be 
noted that the assessment of the significance of a deviation 
from poissonianity is conceptually different when a velocity 
bin given a priori (as in the analysis of small-scale cluster- 
ing) is analyzed, with respect to the case in which a deviation 
in any of the bins is searched for. While in the former situa- 
tion the confidence levels shown in Figg. 1,2,4 (obtained by 
comparing the number of pairs observed in each bin with the 
distributions expected from simulations) are perfectly valid, 
in the latter case the significance of a positive or negative 
deviation in any of the bins is considerably less than what 
a naive comparison would at first glance suggest. 

Fig. ^ shows the variation of the amplitude of the two- 
point correlation as a function of the column density thresh- 
old for the sample of the Lyman-a lines. On the same plot 
are reported for comparison the TPCF's derived for high and 
low column density CIV meta l systems from the works by 
Petitj ean and Bergeron (1994) and by Songaila and Cowie 
( 1996) ), respectively. A reference value of log(iVffr) ~ 17T0.5 
has been assigned to high column density CIV metal systems 
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15 16 
log (N HI ) 

Figure 3. Evolution of the amplitude of the two-point correlation 
function as a function of the column density threshold for the 
sample of the Lyman-a lines. The two points in the upper-right 
side of the picture show the correlation of the CIV metal systems 
(see text) 



on the basis of observations of 0420-388 ( Atwood et_aL_T985| 
Petitjean ano^I3cr^crori^l^994 ) and of 2 126-158 (Giallongo et 
al. 199^; Petitjean and Bergeron 1994). For the low column 
density CIV systems a value of l og (JVh j) ~ 15.5 ± 0.6 is 
derived from Songaila and Cowie (1996). 

An extrapolation of the increasing amplitude trend ob- 
served for the TPCF of the Lyman-Q lines would easily in- 
tercept the corresponding estimates derived from the CIV 
metal systems. We stress also the similarity between the 
shapes of the TPCF's of Lyman-a and CIV systems, when 
observed at comparable resolution (cfr. Fig. 4b of Petitjean 
and Bergeron 1994, Womble et al. 1995, Fig. 6 of Songaila 
and Cowie 1996) and the fact that a trend of increasing cor- 
relation with increasing column density is observed also in 
CIV systems, for which the strong systems appear usually in 
clumps of many components, while the weak doublets (gen- 
erally optically thin at the Lyma n limit) are typi cally made 
of an isolated, single component (Bergeron 1995). 

In a recent paper, Fernandez- Soto et al. (1996) have in- 
vestigated the clustering properties of the Lyman-a clouds 
on the basis of the corresponding CIV absorptions, suggest- 
ing that CIV may resolve better the small-scale velocity 
structure that cannot be fully traced by Lyman-a lines. As 
a consequence, the estimates of the TPCF of the Lyman-a 
absorbers, although the effects of the non-negligible width of 
the lines (the "line-blanketing") are considered in our sim- 
ulations, should be regarded as a lower limit to the real 
clustering amplitude. However, it is not straightforward to 
translate the properties of the CIV absorbers in the cor- 
responding ones of the Lyman-a absorbers, since observa- 
tions (e.g. Prochaska and Wolfe 1996) show that the veloc- 
ity structures of high and low-ionization species are often 
different. Since the underestimation of the Lyman-a TPCF 
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Figure 4. Evolution of the two-point correlation function with 
redshift for the sample of the Lyman-a lines with column densities 
above log(7Vjf/) = 13.8. The three redshift intervals 1.7 < z < 
3.1, 3.1 < z < 3.7, 3.7 < z < 4.0 (upper, middle and lower panel, 
respectively) have been chosen in a way that the confidence limits, 
estimated as in Fig. 2-3, turn out to be roughly the same in all 
three ranges 



would be more severe at larger column densities, the trend 
of an increasing correlation with increasing column density 
appears to be real, from the lowest column densities up to 
those corresponding to the strongest metal systems. 

In Fig. W we study the evolution of the TPCF with the 
redshift for the sub-sample of Lyman-a lines with column 
densities log(NHi) > 13.8. The amplitude of the correla- 
tion at 100 km s~ decreases with increasing redshift from 
0.85 ± 0.14 at 1.7 < z < 3.1, to 0.74 ± 0.14 at 3.1 < z < 3.7 
and 0.21 ± 0.14 at 3.7 < z < 4.0. Unfortunately, HST data 
are still at t oo low-resolution (e .g. Bahcall et al. 1995) or are 
too scanty (Brandt et al. 1993) to allow a meaningful com- 



parison with the present data. Nonetheless, the existence at 
low-redshifts of clumps of Lyman-a absorption lines clus- 
tered around metal-line systems, reported by Bahcall et al. 
(1995) as a resemblance with the properties of galaxies and 
of metal absorption systems, is suggestive that the trend il- 
lustrated in Fig. ^ persists at lower redshifts, as confirmed by 
the measure of the Lyman-a TPC F, £ = on scales 



of 200 - 500 km/s at < z < 1.3 fluimer 199rj ) 
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Table 2. QSO Spectra used in the search for voids 



QSO 


Zern 


minimum 


Number of 


Name 




void size 


detected 








LA / I IVipC J 




0636 4 


68 


3.17 


15.9 


1 


2126 — 


15 


3.27 


24.7 





0302 — 


00 


3.29 


17.5 





0956 4 


12 


3.30 


16.9 





2355 4 


JXL- 


3 39 


13 1 


n 


0014 4 


81 


3.41 


15.8 





0055 - 


26 


O.O/ 


1 A Q 


o 
Z 


1208 4 


10 


3.82 


8.5 





0000 - 


26 


4.12 


6.9 


1 



3.2 Large scales 

The TPCF is known to be inefficient for detecting struc- 
tures on scales comparable with respect to the dimension 
of the sample (e.g. Pando and Fang 1996). For this reason 
to investigate the clustering properties of the Lyman-a ab- 
sorbers on scales of several comoving megaparsecs, searching 
for over- and under-densities of lines significantly deviating 
from a Poissonian distribution, we have resorted to different 
statistical techniques. The aim was not only to assess the ex- 
istence of non-random fluctuations on large scales, but also 
to identify the regions of the spectra giving origin to such 
deviations. 



3.2.1 Voids 

A typical way of looking for non-random fluctuations of the 
line density on large scales is the search for voids. Voids in 
the Lyman-a forest provide a test for models of the large- 
scale structure in the Universe and of the homogeneity of 
the UV ionizing flux. Previous searc hes for megaparsec-sized 
voids have produced a few claims ( Crotts 1989] ; Dobrzycki 
and Bechtold 1991t), but uncertainties in the line statistics 



strongly influence the probability estimate ( Dstrikcr 1988 ) . 
High resolution data, less affected by blending effects, are 
ideal also for the study of voids, but great care has to be 
taken in the statistical approach to avoid the pitfalls of "a 
posteriori statistics" . Detections of voids in individual cases 
are interesting but a more general approach, assessing more 
quantitatively how common is the phenomenon, is to be 
preferred. Differences in the redshifts and S/N (and con- 
sequently in the line density) make difficult a global and 
uniform usage of our extended sample in this sense. There- 
fore we have designed the following procedure: for each of the 
objects listed in Table |l| and for lines with \og{Nui) > 13.3 
we have estimated through Montecarlo simulations the void 
size Ar (in comoving coordinates) for which the probabil- 
ity to find at least one void > Ar is 0.05. The objects for 
which the density of lines and the absence of spectral gaps 
allow a meaningful analysis are listed in Table |^. Then we 
have searched each spectrum for voids of dimensions equal 
or larger than Ar. In 3 out of 9 cases at least one void was 
detected (two voids for 0055 — 26). The binomial probability 
corresponding to such occurrences is 8 • 10 -3 . 

The four voids reported in Table fca have been found in 



the spectra of 06364-68 at 4339.5-4361.6 A (16.1 Mpc size), 
0055 - 26 at 4979.05 - 5016.6 and 5188.4 - 5223.8 A (22.3 
and 19.8 Mpc, respectively) and 0000 - 26 at 5627 - 5644 
A (8.5 Mpc). The well known void at 5050 - 5100 A in the 
spectrum of 0302 - 00 ( [Dobrzycki and Bechtold 1991h| ) has 
not been found because the spectrum of Hu et al. (1995) 
does not cover this region of the Lyman forest. From the 
published spectra and from Fig. 5 it is apparent that the 
regions corresponding to the voids are not completely devoid 
of lines: weak absorptions are observed within the voids. 



This agrees w ith low-redshift observations (Shull, Stocke and 
Penton 1996) that have shown that in the local Universe 
voids are not entirely devoid of matter. 

Even if underdense regions are statistically significant in 
our sample, the filling factor is rather low: the voids reported 
in Table ^ cover only about 2% of the available line-of-sight 
path-length, confirming what already noticed by Carswell 
and Rees (1987). 



3.2.2 Over- and under-densities of lines 

Voids are just an extreme case of spectral regions showing an 
under-density of lines. Various theoretical reasons prompt to 
tackle the issue of over- and under-densities of lines from a 
more general point of view: the typical signature of a "prox- 
imity effect" due to a foreground quasar is a lack of weak 
l ines in the Lyman forest, rather than lack of lines in general 
(Dobrzycki and Bechtold 1991a); the relative filling factor of 
under-densities and over-densities may provide a constraint 
on the theories of structure formation. 

We have analysed the spectra of the quasars reported 
in Tab. [ljwith a counts-in-cells technique searching for over- 
and under-densities of lines with log(Nm) > 13.3 on scales 
from 10 to 80 Mpc and comparing the observed counts with 
Montecarlo simulations in order to assess the significance 
of the deviations. On smaller scales the shot noise is too 
large, on larger scales the "integral constraint", forcing the 
simulated number of lines to be equal to the observed one, 
prevents from the possibility of detecting any deviation. The 
threshold to define significant a deviation (for example in 
excess) in a given spectral interval has been set in a way that 
at a given scale for a given quasar there is a 0.05 probability 
of observing at least one deviation of this type on the whole 
spectrum for a locally Poissonian distribution. In Table ^ we 
report the results of the analysis. An asterisk denotes that 
a meaningful search at that particular scale has not been 
possible for the given object. 

5 QSOs out of 15 show at least one over-density in their 
spectrum at 10 Mpc scales, corresponding to a binomial 
probability of 6 ■ 10~ 4 of being drawn from a poissonian 
distribution of lines. 4 QSOs show at least one over-density 
of lines at 20 and 30 Mpc scales, corresponding to a binomial 
probability of 5 • 10~ 3 . At larger scales the number of signif- 
icant over-densities decreases. None is observed at 80 Mpc. 
Over-densities detected at one scale tend to persist at larger 
scales, analogously to what observed in the wavelet identifi- 



cation of st ructures from galaxy counts ( 31ezak,Bijaoui and 
Mars 1990b. 



Under-densities appear to be roughly as common as 
over-densities, once the lack of sensitivity at lower redshifts 
and smaller scales, due to the low density of lines, is taken 
into account. In practice in Table ^ the voids of Table ^ 
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Figure 5. Over and under-densities of lines at a scale of 20 Mpc in the spectrum of Q0055 — 26. The positions of two voids, at 5000 
and 5200 A respectively (see Tab. 2), are shown together with and over-density at 5080 A. The relatively low resolution of the plot does 
not allow to fully visualize the significance of the over-density: 15 lines with log(Nm) > 13.3 are observed while only 7 are expected. 



Table 3. Over-densities of lines found in QSO spectra 



Table 4. Under-densities of lines found in QSO spectra 



Scale 


10 Mpc 


20 Mpc 


30 Mpc 40 Mpc 


80 Mpc 


Scale 


10 Mpc 


20 Mpc 


30 Mpc 


40 Mpc 


80 Mpc 


QSO 




over 


density center 




QSO 




undcr-density center 




Name 






(A) 




Name 






(A) 






1331 + 17 


3494 


3493 


3490 3490 




1331 + 17 


* 


* 


3707 








3605 


3609 






1101 - 26 


* 


* 


3679 




3670 


1101 - 26 


3742 




3743 




2206 - 19 


* 


4238 








2206 - 19 










1700 + 64 


* 










1700 + 64 










1946 + 76 


* 






4799 




1946 + 76 


4427 


4435 






0636 + 68 


* 










0636 + 68 










2126 - 15 


* 










2126 - 15 










0302 - 00 


* 










0302 - 00 






4696 




0956 + 12 


* 










0956 + 12 










2355 + 01 


* 










2355 + 01 




5217 






0014 + 81 


* 










0014 + 81 










0055 - 26 


5263 


5001 


5007 


4988 




0055 - 26 


5085 


5080 










5208 


5406 
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are recovered, except for the void of 0636 + 68, the least 
significant case. 



4 DISCUSSION 

As shown by Fernandez-Soto et al. (1996), a TPCF approach 
applied - as in the present work - directly to the Lyman- 
a lines tends to underestimate their clustering properties. 
However, the very fact that a signal is found, in spite of all 
the limitations described by Fernandez-Soto et al. (1996), is 
a significant result and, even if regarded as lower limits, the 



measured clustering amplitudes allow to draw interesting 
inferences. 

In this way, the observed clustering properties are quali- 
tatively consistent with a scenario of gravitationally induced 
correlations. The dependence of the clustering amplitude 
on the column density, reported in subsection 3.1, is eas- 
ily explained by models involving biasing for the formation 
of structures in the universe. Objects associated with the 
stronger potential wells are expected to be more clustered 
and the HI column density is naturally related to the depth 
of the well (and to the associated mass). 

The trend of increasing correlation with decreasing red- 
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shift (Fig. g|) is also a strong prediction of any model of 
structure formation based on gravitational instabilit y. On 



the contrary, theories of explosive structure formation (Vish 



the cloud proper sizes of the order of 200h^ kpc. It is in- 
teresting to compare this upper limit with the lower lim- 
its on the cloud dimensions inferred from the observations 



niac, C striker and Bertschinger 1985 ; Weinberg, Ostrikei 
and Dc kel 19881)" expect a velocity correlation either un 
changir g or diminishing with time. 



of QSO pairs or gravitational lenses ( Smette et al. 1995 



Smette et al. 1995 



Bechtold et al. 1993 ; pinshaw et al 



1995), r c i 100/i 50 kpc. Indeed, the trend observed in Fig. 



The existence of a roughly equal number of over and 
under-densities on scales 10 — 40 Mpc is easily understand- 
able in terms of the linear theory of the evolution of the 
perturbations, that is a plausible approximation at such rel- 
atively high redshifts: gravity has not yet had time to give a 
significant skewness to the (under) over-density distribution. 
Besides, almost any hierarchical clustering scenario would 
expect that at z ~ 2 — 4 gravity has not yet had enough 
time to transfer power on 80 Mpc scales and give origin to 
significant over or under-densities. 

Numerical simulations in the framework of a CDM 
model, including photo-ionization and cooling of the bary- 
onic component, have reproduced to a remarkable extent the 
observed correlation functio n and also its depend ence on the 
column density of the lines ( Mxicket et al. 1996 \. 



|4| is consistent, in the framework of a simple hierarchical 
clustering model predicting that r scales with the redshift 
according to the relation r (z) = r o (0)(l + z)~ 5 ^ 3 , with large 
cloud sizes (r c i 200/i^q 1 kpc) and low velocity dispersions 
(few tens km/s). 

Assuming an index 7 = 1.77 for the power-law spatial 
correlation function as commonly measured for galaxies, rel- 
atively low velocity dispersions (< 100 km/s) as suggested 
by observations of quasar pairs (e.g. Dinshaw et al. 1994) 
and cloud sizes of 200/i^q 1 kpc, implies, to reproduce the ob- 
served correlation in the velocity space, a correlation scale at 
z ~ 3 of 200 — 300/i^q 1 comoving kpc. This amplitude extrap- 
olates, according to the (1 + z)~ 5 ^ 3 scaling, to 2 — Mpc 
at the present epoch, to be compared with the values 10 and 
7.5fe^ n 1 Mpc observed for present day optical (|ie Lapparcnt 



In this way the observed clustering properties (in par- Gcllcr and Huchra 1988 ) and IRAS (Fisher, Davis, Straus; 



he correlation between clustering amplitude and col- et al. 1994 ) galaxies, respectively. 



ticular 

umn density) become part of a new continuity scenario that 
rests on two other pieces of evidence: the metallicities of the 
order 10~ 2 observed for Lyman-a clou ds with log Nhi > 14 



(Cowie et al. 1995; Tytler et al. 1995) and the volume den- 
sity an d cosmological evolution of the same clouds found 



A spatial correlation function with a correlation scale 
r (z = 0) ~ 3 Mpc, convolved with a velocity dispersion 
a ~ 50 km/s, would result in a correlation function in the ve- 
locity space non d issimilar with respec t to what is observed 
for C IV systems (Wbmble et al. 1995; Songaila and Cowie 



to be similar to those of the damped systems (Giallongo et 1996) if a cloud radius ~ 15/i 50 proper kpc is assumed. Such 



al. 199( p . All together this suggests a physical association 
between the Lyman-a clouds with log JV777 > 14 and the 
halos of protogalactic systems. At low redshift a consider- 
able fraction of the Lyman-a lines has been observed to be 
associated with luminous galaxies and the local large-scale 



structu re ( Lanzetta, Bowen, Tytler et al. 1995 
Berger p^Tl996| ) 



Le Brun and 



The typical column density below which no clustering of 
the Lyman-a lines is observed (log(NHi) ~ 13.6, Fig. |l|) cor- 
responds to t he position of the brea k in their column density 
distribution (Giallongo et al. 1996; Hu et al. 1995[ ), which 



has been identified with the transition from a variety of sys- 
tems in various stages of gravitational infall and collapse 
(or even und er-densities) to gas associated with star form- 



a value appears considerably smaller than what is commonly 
inferred for galactic halos giving origin to metal-rich absorp- 
tion systems (r ~ 75/iJ kpc, Bergeron 1995), but certainly 
at least part of the small-scale correlation observed for CIV 
systems is due to the motion of clouds of size << 75 kpc 
within individual galaxy halos. The observed correlations in 
the velocity space might then arise from the same spatial 
correlation function both for Lyman-a and metal systems, 
with different degrees of amplitude suppression due to the 
different sizes of the regions giving origin to the Lyman-a 
and metal absorption, a well-known phenomenon occurring 
when the cloud si zes are non- n egligible with respect to the 
correlation scale (Bajtlik 1995). 



Miicket et 



ing galaxies (Zhang, Anninos and Norman 1996 
al. 199^ 

Following Heisler, Hogan and White (1989) and Cris- 
tiani et al. (1995), it is possible to derive the three- 
dimensional spatial autocorrelation function of the clouds 
£ r , from the velocity correlation £„, assuming a Gaussian 
distribution of the peculiar motions of the clouds with re- 
spect to the Hubble flow and a power-law form for the spatial 
correlation function, as observed for galaxies. 



x < exp 



Hdr £(r) P(v \ r) oc 



Hdr 



Idr ( r V 
o \r J 



(Hr 



2a 2 



+ exp 



(Hr + v) 2 
2a 2 



(4) 



At small velocity splittings, the correlation scale r de- 
pends mainly on the cloud sizes r c i and on the velocity dis- 
persion a assumed. The very fact that a significant corre- 
lation in velocity is observed results in an upper limit on 



5 CONCLUSIONS 

The analysis of a large database (~ 1600 lines) of Lyman-a 
absorptions observed at ~ 30000 resolution in the spectra of 
15 quasars has produced the following results: 

(i) clustering on small velocity scales (Av ^ 300 km/s) 
has been measured at a 5a confidence level for the full 
line sample, which has typically log (JVhj) <; 13, with 
6,(100 km/s) = 0.2 

(ii) the clustering amplitude increases with increasing col- 
umn density of the lines: £ v (100 km/s,\og(NHi) > 13.8) = 
0.6. No significant clustering is observed for lines with 
log (JVh 7) 13.6. An extrapolation of this trend to column 
densities typical of metal systems is consistent with the clus- 
tering observed for CIV lines 

(iii) a trend of decreasing clustering with increasing red- 
shift is observed 
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(iv) a number of over- and under-densities of lines (in- 
cluding 4 voids) have been identified on scales of a few tens 
of Mpc, confirming at a high confidence level the reality of 
deviations from a poissonian distribution of lines on such 
relatively large scales. The filling factor of such structures 
is however low: a few percent of the total line-of-sight path- 
length. 

Although the TPCF approach adopted in the present pa- 
per tends to underestimate the small-scale clustering ampli- 
tudes, whose measured values have to be regarded as lower 
limits, the following conclusions can be drawn: 

(i) the observed clustering properties appear qualitatively 
consistent with a scenario of gravitationally induced corre- 
lations 

(ii) simple considerations about the two-point correlation 
function in real and velocity space allow to deduce an up- 
per limit to the cloud sizes r c ; Ss 200/i^j 1 proper kpc and a 
correlation scale r — 200 — SOOh^f comoving kpc at z ~ 3. 

(iii) a continuity scenario between Lyman-a and metal 
systems is suggested, with a physical association between 
the Lyman-a clouds with \o%Nhi <; 14 and the halos of 
protogalactic systems. 
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